Dependence-Maximization Clustering with Least-Squares Mutual Information
نویسندگان
چکیده
Recently, statistical dependence measures such as mutual information and kernelized covariance have been successfully applied to clustering, called dependencemaximization clustering. In this paper, we propose a novel dependencemaximization clustering method based on an estimator of a squared-loss variant of mutual information called least-squares mutual information. A notable advantage of the proposed method over existing ones is that hyperparameters such as kernel parameters and regularization parameters can be objectively optimized based on cross-validation. Thus, subjective manual-tuning of hyperparameters is not necessary in the proposed method, which is a highly useful property in unsupervised clustering scenarios. Through experiments, we illustrate the usefulness of the proposed approach.
منابع مشابه
Direct Approximation of Quadratic Mutual Information and Its Application to Dependence-Maximization Clustering
Mutual information (MI) is a standard measure of statistical dependence of random variables. However, due to the log function and the ratio of probability densities included in MI, it is sensitive to outliers. On the other hand, the L2-distance variant of MI called quadratic MI (QMI) tends to be robust against outliers because QMI is just the integral of the squared difference between the joint...
متن کاملSuffcient Component Analysis
The purpose of sufficient dimension reduction (SDR) is to find a low-dimensional expression of input features that is sufficient for predicting output values. In this paper, we propose a novel distribution-free SDR method called sufficient component analysis (SCA), which is computationally more efficient than existing methods. In our method, a solution is computed by iteratively performing depe...
متن کاملComputationally Efficient Sufficient Dimension Reduction via Squared-Loss Mutual Information
The purpose of sufficient dimension reduction (SDR) is to find a low-dimensional expression of input features that is sufficient for predicting output values. In this paper, we propose a novel distribution-free SDR method called sufficient component analysis (SCA), which is computationally more efficient than existing methods. In our method, a solution is computed by iteratively performing depe...
متن کاملCanonical dependency analysis based on squared-loss mutual information
Canonical correlation analysis (CCA) is a classical dimensionality reduction technique for two sets of variables that iteratively finds projection directions with maximum correlation. Although CCA is still in vital use in many practical application areas, recent real-world data often contain more complicated nonlinear correlations that cannot be properly captured by classical CCA. In this paper...
متن کاملThe Factors Affecting Partnership Quality of Information Systems Outsourcing of PACS for Hospital
The purpose of this research is to investigate the determinants of outsourcing partnership between clients and providers in outsourcing projects. Furthermore, the relationship between the quality of outsourcing partnership and the success of outsourcing was also investigated in this research. Survey methodology was utilized, and the subjects were the medical centers that involved in PACS projec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JACIII
دوره 15 شماره
صفحات -
تاریخ انتشار 2011